Search CORE

240 research outputs found

Polyphonic Sound Event Detection by using Capsule Neural Networks

Author: Gabrielli Leonardo
Principi Emanuele
Squartini Stefano
Vesperini Fabio
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/01/2019
Field of study

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, Deep Learning offers valuable techniques for this goal such as Convolutional Neural Networks (CNNs). The Capsule Neural Network (CapsNet) architecture has been recently introduced in the image processing field with the intent to overcome some of the known limitations of CNNs, specifically regarding the scarce robustness to affine transformations (i.e., perspective, size, orientation) and the detection of overlapped images. This motivated the authors to employ CapsNets to deal with the polyphonic-SED task, in which multiple sound events occur simultaneously. Specifically, we propose to exploit the capsule units to represent a set of distinctive properties for each individual sound event. Capsule units are connected through a so-called "dynamic routing" that encourages learning part-whole relationships and improves the detection performance in a polyphonic context. This paper reports extensive evaluations carried out on three publicly available datasets, showing how the CapsNet-based algorithm not only outperforms standard CNNs but also allows to achieve the best results with respect to the state of the art algorithms

arXiv.org e-Print Archive

IRIS UniversitÃ Politecnica delle Marche

Classification of bearing faults through time-frequency analysis and image processing

Author: Collura Stefano
Rossetti Damiano
Squartini Stefano
Zhang Yu
Publication venue
Publication date: 09/12/2016
Field of study

The present work proposes a new technique for bearing fault classification that combines time-frequency analysis with image processing. This technique uses vibration signals from bearing housings to detect bearing conditions and classify the faults. By means of Empirical Mode Decomposition (EMD), each vibration signal is decomposed into Intrinsic Mode Functions (IMFs). Principal Components Analysis (PCA) is then performed on the matrix of the decomposed IMFs and the important principal components are chosen. The spectrogram is obtained for each component by means of the Short Time Fourier Transform (STFT) to obtain an image that represents the time-frequency relationship of the main components of the analyzed signal. Furthermore, Image Moments are extracted from the spectrogram images of principal components in order to obtain an array of features for each signal that can be handled by the classification algorithm. 8 images are selected for each signal and 17 moments for each image, so an array of 136 features is associated with every signal. Finally, the classification is performed using a standard machine learning technique, i.e. Support Vector Machine (SVM), in the proposed technique. The dataset used in this work include data collected for various rotating speeds and loads, in order to obtain a set of different operating conditions, by a Roller Bearing Faults Simulator. The results have shown that the developed technique provides classification effectively, with a single classifier, of bearing faults characterized by different rotating speeds and different loads

University of Lincoln Institutional Repository

IRIS UniversitÃ Politecnica delle Marche

Exploiting heterogeneous data for the estimation of particles size distribution in industrial plants

Author: Collura Stefano
Rossetti Damiano
Squartini Stefano
Zhang Yu
Publication venue
Publication date: 08/12/2016
Field of study

In industrial environments, it is often difficult and expensive to collect a good amount of data to adequately train expert systems for regression purposes. Therefore the usage of already available data, related to environments showing similar characteristics, could represent an effective approach to find a good balance between regression performance and the amount of data to gather for training. In this paper, the authors propose two alternative strategies for improving the regression performance by using heterogeneous data, i.e. data coming from diverse environments with respect to the one taken as reference for testing. These strategies are based on a standard machine learning algorithm, i.e. the Artificial Neural Network (ANN). The employed data came from measurements in industrial plants for energy production through the combustion of coal powder. The powder is transported in air within ducts and its size is detected by means of Acoustic Emissions (AE) produced by the impact of powder on the inner surface of the duct. The estimation of powder size distribution from AE signals is the task addressed in this work. Computer simulations show how the proposed strategies achieve a relevant improvement of regression performance with respect to the standard approach, using ANN directly on the dataset related to the reference plant

University of Lincoln Institutional Repository

IRIS UniversitÃ Politecnica delle Marche

Adaptive Linear Prediction Filtering in DWT Domain for Real-Time Musical Onset Detection

Author: Francesco Piazza
Leonardo Gabrielli
Stefano Squartini
Publication venue
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

Directory of Open Access Journals

IRIS UniversitÃ Politecnica delle Marche

Open Access Repository

A Real-Time Speech Enhancement Front-End for Multi-Talker Reverberated Scenarios

Author: Emanuele Principi
Francesco Piazza
Rudy Rotili
Stefano Squartini
Publication venue: 'IntechOpen'
Publication date: 01/01/2012
Field of study

IntechOpen

Crossref

IRIS UniversitÃ Politecnica delle Marche

A Time-Frequency Generative Adversarial based method for Audio Packet Loss Concealment

Author: Aironi Carlo
Cornell Samuele
Serafini Luca
Squartini Stefano
Publication venue
Publication date: 28/07/2023
Field of study

Packet loss is a major cause of voice quality degradation in VoIP transmissions with serious impact on intelligibility and user experience. This paper describes a system based on a generative adversarial approach, which aims to repair the lost fragments during the transmission of audio streams. Inspired by the powerful image-to-image translation capability of Generative Adversarial Networks (GANs), we propose bin2bin, an improved pix2pix framework to achieve the translation task from magnitude spectrograms of audio frames with lost packets, to noncorrupted speech spectrograms. In order to better maintain the structural information after spectrogram translation, this paper introduces the combination of two STFT-based loss functions, mixed with the traditional GAN objective. Furthermore, we employ a modified PatchGAN structure as discriminator and we lower the concealment time by a proper initialization of the phase reconstruction algorithm. Experimental results show that the proposed method has obvious advantages when compared with the current state-of-the-art methods, as it can better handle both high packet loss rates and large gaps.Comment: Accepted at EUSIPCO - 31st European Signal Processing Conference, 202

arXiv.org e-Print Archive

Learning to Rank Microphones for Distant Speech Recognition

Author: Brutti Alessio
Cornell Samuele
Matassoni Marco
Squartini Stefano
Publication venue
Publication date: 01/01/2021
Field of study

Fully exploiting ad-hoc microphone networks for distant speech recognition is still an open issue. Empirical evidence shows that being able to select the best microphone leads to significant improvements in recognition without any additional effort on front-end processing. Current channel selection techniques either rely on signal, decoder or posterior-based features. Signal-based features are inexpensive to compute but do not always correlate with recognition performance. Instead decoder and posterior-based features exhibit better correlation but require substantial computational resources. In this work, we tackle the channel selection problem by proposing MicRank, a learning to rank framework where a neural network is trained to rank the available channels using directly the recognition performance on the training set. The proposed approach is agnostic with respect to the array geometry and type of recognition back-end. We investigate different learning to rank strategies using a synthetic dataset developed on purpose and the CHiME-6 data. Results show that the proposed approach is able to considerably improve over previous selection techniques, reaching comparable and in some instances better performance than oracle signal-based measures

arXiv.org e-Print Archive

Archivio della ricerca - Fondazione Bruno Kessler

Multi-household energy management in a smart neighborhood in the presence of uncertainties and electric vehicles

Author: Emanuele Principi
Luca Serafini
Stefano Squartini
Susanna Spinsante
Publication venue: 'MDPI AG'
Publication date: 01/12/2021
Field of study

none4noThe pathway toward the reduction of greenhouse gas emissions is dependent upon increasing Renewable Energy Sources (RESs), demand response, and electrification of public and private transportation. Energy management techniques are necessary to coordinate the operation in this complex scenario, and in recent years several works have appeared in the literature on this topic. This paper presents a study on multi-household energy management for Smart Neighborhoods integrating RESs and electric vehicles participating in Vehicle-to-Home (V2H) and Vehicle-to-Neighborhood (V2N) programs. The Smart Neighborhood comprises multiple households, a parking lot with public charging stations, and an aggregator that coordinates energy transactions using a Multi-Household Energy Manager (MH-EM). The MH-EM jointly maximizes the profits of the aggregator and the households by using the augmented ɛ-constraint approach. The generated Pareto optimal solutions allow for different decision policies to balance the aggregator’s and households’ profits, prioritizing one of them or the RES energy usage within the Smart Neighborhood. The experiments have been conducted over an entire year considering uncertainties related to the energy price, electric vehicles usage, energy production of RESs, and energy demand of the households. The results show that the MH-EM optimizes the Smart Neighborhood operation and that the solution that maximizes the RES energy usage provides the greatest benefits also in terms of peak-shaving and valley-filling capability of the energy demand.openLuca Serafini, Emanuele Principi, Susanna Spinsante, Stefano SquartiniSerafini, Luca; Principi, Emanuele; Spinsante, Susanna; Squartini, Stefan

Multidisciplinary Digital Publishing Institute

Directory of Open Access Journals

IRIS UniversitÃ Politecnica delle Marche

Genome sequence of Enterococcus mundtii EM01, isolated from Bombyx mori midgut and responsible for flacherie disease in silkworms reared on an artificial diet

Author: Campanaro Stefano
Cappellozza Silvia
de Diego-Diaz Beatriz
Duarte Vinicius da Silva
Saviane Alessio
Squartini Andrea
Treu Laura
Publication venue: 'American Society for Microbiology'
Publication date: 01/01/2018
Field of study

The whole genome sequence of Enterococcus mundtii strain EM01 is reported here. The isolate proved to be the cause of flacherie in Bombyx mori. To date, the genomes of 11 other E. mundtii strains have been sequenced. EM01 is the only strain that displayed active pathological effects on its associated animal species

Crossref

Online Research Database In Technology

Archivio istituzionale della ricerca - Università di Padova